# Fine-grained alignment
Fg Clip Large
Apache-2.0
FG-CLIP is a fine-grained vision and text alignment model that achieves global and region-level image-text alignment through two-stage training, enhancing fine-grained visual understanding ability.
Multimodal Alignment
Transformers English

F
qihoo360
538
3
Wspalign Xlm Base
WSPAlign is a weakly supervised large-scale span prediction-based word alignment pre-training model that supports word alignment tasks for multiple language pairs.
Machine Translation
Transformers Supports Multiple Languages

W
qiyuw
77
0
Featured Recommended AI Models